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m m m 

*r hS^AB*&> *ft£3- Afccfctf* h^o^y-f K©82&i£ 



P) HOUWcbS i?^ f ^7 'i /Pfn 'J (DMAPP) jWfiS&LTClO© 

y^-^bro »j (cpp) fr£.i&zn, w.\ci??t>m&LTCi5<Dy r^^i/Jis 

tn'J>fi( (FPP) a<£fi&$n*o FPPti, UglPPiJS^-r^CifCkoTCao 

oy5-;uy*5-;i/fo ij (ggpp) »jcggpp|51±^^lt, 

(Britton.G. , "Biosynthesis of carotenoids". Plant Pigments. London, Aca 
demi c Press, 1988, p. 133-182. (Goodwin, T.W. ed. ) 0 

&BJ#^(i. M^#&#ft&J&8B®Erwinia uredovora®* u=f- J << K 
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T*&£*'7*1r>^y££g£tf aci^aJfiiKL/;: 7 El#j80 (Misawa.N. 
, Nakagawa.M. , Kobayashi, K. , Yamano, S., I zawa, Y. , Nakamura, K. , Harashi 
ma, K. , "Elucidation of the Brwinia uredovora carotenoid biosynthetic pat 
hway by functional analysis of gene products expressed in Escherichia co 
li.", J. Bacterid., 172, p. 6704-6712, 1990. ^ Misawa, N. , Yamano, S., Ik 
enaga. H. , "Production of 0 -carotene in Zymomonas raobilis and Agrobacter 
ium tumefaciens by introduction of the biosynthesis genes from Brwinia u 
redovora.", Appl. Environ. Microbiol., 57, p. 1847-1849, 1991 
3-58786-v£fB#Ra) c 

u z ± {* x* % t z a &itm % * w&t * iKMm t l r *> & a £ m #> x t *s 

28, p. 219-227, 1990#M) o 
TX^+if^^^O^aSiLTti. SIt*7;fO?^ HSPhaffia 
iiii&HaematococcusOl&ftifeK &tf^1&&ffc&;iz tltzit&Wl 

g-TSo $Whaffia©i£«$|-ek> *©iffllS»*<3fiil"C U^7X^+t>f 
>©£gfi;0<{&^A:*!K r-£gt£o HggHaemat 

ococcus©i&«$J©l££x ig«NMC7X * ^ >£j£lC*oJ£<£#;£{fci£ L 

tut tints ^R|^W©^«60i:«fe^#^AX5t^©/ca6o«#^S^ 
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LT©££f4^7X**lf:^>©8£tt£&li©ffiT?S 

«i©WHfctt<, ^H©ii^L^ite?aif^©#& ; £-^oT^n(fs 

i/*oi>* x* >^ >©£ft«*±tf* c £ i>St L < tti* i # 

3MC«fc«9> -4"C£#&&dtM Erwinia uredovora ft*&Rft3nTV£ e 
Ltt#&, 7***1f>f->*£jfc*S©K:#Stty h*i*A***3- K"f 5 
iftfi?^©K»{*, MiEL/'cJ:9^7X^+-9->5 L >©S^±©Wffltt©i»:<c:s 

t^iV»©*<att7?*So ;:©«H£LTM^ * haSj*Al***©rtn*/-f K£ 

»W©M* 
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ftktt S C <!: t> i £ -T * o 

Sfcgfllw. **Wtts ^I20^r h£&A&$£3- K-r*«fe^*»ALte* 

*&HJJ#!£te, ±I5S$&£$&t^<ft&lfcftLA:*S&> H&Haematococcus 
pluvialis ©cDNAA^y hg$A#££3- Flf- >^ L> 
^itfc? ^ ^-DNA £M U ? * - D N A £*M K# 

-SCiKfifegjU *!6Bfl^^$-li:5lc5ofco -Ttt*^ #lfigjtt* 0-^* 
^rWtStf'J^:^ K£=J- K-r<S&&B2?'J££trDNA£i§(fc-f So £ bic, 

mfcoDN A*mALtz®.±vit>m®-rz>o ttzwu^ mod 
*mw<o'r b&mA&%it. sits®® # - 4 * ; >m 



PCT/JP95/01640 

WO 96/06172 r 

Z7ij®&mi tit. mn&omn^i. Eyj#-t2> ttz&. e^j#^3 
ic^^tis? s y»EW«>fflu 0-4*; e - * * 

®E?<J#^1> B£?'J#^2, *fct*. E?J#^3lc*$n*7^6&E?'jm>< 
ojWco^Tfcfc> ftj&p«©SE**<*oT*>J:^7 5y»E3WJ«r«5|c-rSc 

EJWOEWWK E?»J#^2, **:l*E?iJS^3l;:*3*lS7^ 
yg&E?ij©mi #g©7 J JWl (Met) a<fcfcLT^S kOtti'kS^SttSo 

-ecHftftttW© 1 o<h LT. -lf7*-9->^V^S©<i: LXA-*r VIST**** 
y *mX7 7, 9 fctZZttfXZZ (^80#RS) o + 

? ^yyf- y&tf7X fir-? >?■ fr&X'&Zo 
2. y hg$A»£iIfc? (bkt) 

0-4*; Z>{t&®® $ - 4 * J y*4> 4 VL<D * * Is * hSK 

z>mmm&*ii\-t k* = - K««usE*j**trDNAT* 

So COHSWttWix jft&Haematococcus pluvial is (NIES-144)«fc *) * p--=- 
S bk titter* 9 > :htt»«C*lH©A*»&D*T07;;i 
E?'J (E^J^©E?'J#^1 K*$n*7 J ./B&ETlJ) > i2E©B^bDJTO 
7i/&E?'J (E?tJ£©E?iJ#^2IC*3ft*7 ^ yKE?'J) x Ztztl&zm® 
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k K^j#-t2, *s «t zmmtt 3 K*$n*7$ yKse^j^ 3- Kir * g&E 

?ti> E9iJjfe©E?iJ*^ 4 „ 5,6. fcitM fc*$ft«5£3£Efll*£frfe©© 

fit©**!/:/***- hgM£&t-*»$£tt£WLT:fc!9. *SHJfe«l^lc*>^ 

($ 8 . ttzs BKT(i> 3 -h Kd*->- 0--f *V >M<D 

fcitfDNAi fe£{*tttt**nS>-liWLT^ttl«V, 0 - >T * y 

Brwinia©^7D^y-r K£fi£>tfciP8k crtE, crtB, 
crtl RtfcrtVZm^ZCtlZ&lO. *»»$'©»£&£ 0 - >*kmik* 
C HK±IE© 4 o©iteTJCjnx.crtZ3liE^fcfflV>S C iJC«t 

© crt«fc^#U:«fc')ifcte3tlS©T'> Brwinia©crt 
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S^©«&£^KMK*f&BJ?©DNA (bktitfc^) ZmAirZt, 

©£-? y/h+lf^vS-gftiLT. ±f2©±gft3»«&©fliH^ £ - ? U 
^■TS)g?Sfc?¥4-rS<i:%X.t>n> C©W*-ttttf»UI*&©M<!:LT. 3*- 

* 

3. DNA©5?i# 

C©ffc5*£lfci£ J: Haematococcus (tt^Wtt l>©(CHaematoc 

occus pluvial is -^Haematococcus lacustris^30<££) £>mRNA£EXf# ^ 

ft&tf)};:^ Haematococcus pluvialis ©£R N A£#llt U t'J^fyn 
-dT3 OX-A- (^Sig (#) ) £ffl^T*DA + RNA^mJfit-So 
#'JA + RNA^SMcLTs i$!(fc2?g*£SUPBRSCRIPT RT (GIBCO BRL)T?*gffilS 
DNA^ML> J^TE. coli DNA 'J#--tr\ E. coli DNA ^^5-^ E. 
coli DNA RNase H (£TGIBC0 BRl)£ffl^T 2 c D N A££j&t So 
L/cc DNA£*Jiffiffl2SSI'<* ^-pSPORTl (GIBCO BRDlCjSa^ii*, cDNA 
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<k-?Z>*mm (±IBL/cBrwinia ©crt ABTUZRW-? Zizmm) ZWgfcte 

A^ii^^ ^-pBluescript II SK+:foJ:r/pBluescript II KSt(STRATAGBNE) 
4. bktift&^K'W :/ 'J ^ S D N A 

g*«T(i*<5jo>'Piegl&£#l«DPhaffia rhodozyma *>7* 9 yf- >&<D>r h 
•hVf-J < Y^ffcthZ. tt)<n&2tlT^Z (Johnson. B.A. and An, G. -Hwan, 
"Astaxanthin from microbial sources", Critical Reviews in Biotechnology, 
11, 297-326, 1991) 0 ffi& LfcHaematococcus pluvial is NIBS-144©bktitfc 

* 

Haematococcus ©tfj^&N Haematococcus pluvial is NIBS-144<t (i-tCDSHktt-^ 
ftlc^t" &$k&M<D&t£& 2m, ttet>*>. Haematococcus lacustris UTEX 294 
(The Culture Collection of Algae at the University of Texas at Austin^ 
, Haematococcus lacustris C-392 [Jj£j**#&ffl$c£#Jffl£f?T (M 

Ctlt>©?{!!fe#DNA£!BI!IU Haematococcus pluvial is NIBS-144©bkt:»f5 

^^(D^WM'O, bkt©yD-y(±2a3SO^gSHaematococcus CDl>-ftt®&fe 
frDN Aiz&X-f ZftfeVDN A\Kfr\zt>&< 7 >) ^-TXLfco ftoT*& 
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*%W<DDNA*ft*:&& :: Pt LXM^Umm ±mm, Zymomonas 

mobilis. Agrobacterium tumefaciens) (Saccharomyces cerevisiae) ^ 

tlTV^fc©^ "Vectors for cloning genes", Methods in Enzymology, 

216. p. 469-631, 1992, Academic Press> "Other bacterial systems", 

Methods in Enzymology, 204, p. 305-636, 1991, Academic Press #RS) icig 

(fc Sambrook, J., Fritsch, B. F. , Maniatis. T., "Molecular clon 

ing -A laboratory manual." Cold Spring Harbor Laboratory Press, % 1 
7 4H~m8 4 1, 1989 #113) o *!l§ffi-£ft#ilfc3 1 £&3a£ -li:S/ctolc{i. 
?!t&lCt£U ifztfLVL^ fij&© "Molecular cloning -A laboratory manual, 
1 7tf 3 4 1 H" ) o 

#180 . lac ©7D*-*-£W***li8f§^?*-Ktt*ii£ 

HiALfcfco^IgCiAtsai'o #ISW#3?ti> lac O^Dt-^ 
-^^W-r-SAHSffl cDNAfg^?*- pSPORTl (GIBCO BRL *±) lac 
©■/a* — ? — ij — Kx;U-£§:tf Softie N Haematococcus <Dbkt afi 

fi^jfAU dn^^P6Kc^ALfCo 

< B^O&e^A > 

Saccharomyces cerevisiae ^•<Dfl.#ifi£ : jP©3>A&<t LTtix Uf-^A 
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K ^ CPD f©yD*-^-fc<t^-;*-^-^ffll^ fl.#itfc^£C© 
tz%Mii-kv r-£ffigU Z.<bW&i3*v h£> S. cerevisiae (D^9 

yrp^ (mm&&fc<ow$&nt:&.m&&ttz&nmv>i> : f-3t'-'< 

9 9-) . YBp£ (#«®2#m DNA©£$8jtg&£ftoa^ffl-7;P^3 

9 JuV-i 9 -VU B*SSik^ABCi/'J-X rftB4SOfc 

tfxDitfc^I^J SJJ&fcJEfK fc.fcfA Yamano, S., Ishii, T., Nakagawa, M. , 

Ikenaga, H. , Misawa, N. , "Metabolic engineering for production of 0-car 
otene and lycopene in Saccharomyces cerevisiae". Biosci. Biotech. Bioc 
hem., 58, P. 1112-1114, 1994 #J80 „ 

< Zymomonas mobi 1 i's^OitfisT-^A > 

Zymomonas mobi lis ^Oft&itlK^O&Alts ^7^1 
&glc&iItt&££&&IC«fc9lT9C ta<T£So Zymomonas mobilisT^it 
BTZftrnZitZtztilztt, tt*itfc : ?£#ALfc&$i / <* 9- Z 
ymomonas mobilis ffi'<9 9 — pZA22) £ Zymomonas mobil isMsSA^S t 

rzymomonas lfflS©OTWaJ x B*&%it¥&U. 63. p. 1016-1 
018, 1989> fc<fcO\ Misawa, N. , Yamano, S., Ikenaga, H. , "Production of fa- 
carotene in Zymomonas mobilis and Agrobacterium tumefaciens by introduc 
tion of the biosynthesis genes from Brwinia uredovora". Appl. Environ. 
Microbiol., 57, p. 1847-1849, 1991#RS) „ 
< Agrobacterium tumefaciens^03l6K : F^A> 

fe®]&il%im& Agrobacterium tumefaciens ^(Dft&M&^O&Ate^ 951* 
l£i4ei£#i§te£&fiii&K<fcOfT9 CtfrT-ZZo Agrobacterium tumefacie 
ns X<ft&i&&=F-Z&mZlt2>tz)6\^ H&A&ttl* ALtz^m.^9 9- 
(fc<h*.tJ\ Agrobacterium tumefaciens W<9 9 -PBI121) £ Agrobacterium 

l o 
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tumefacienslcgA1"S<!: <U> (Misawa, N.. Yamano, S., Ikenaga, H., "Prod 
uction of 0 -carotene in Zymomonas mobilis and Agrobacterium tumefaciens 

by introduction of the biosynthesis genes from Brwinia uredovora". Ap 
pi. Environ. Microbiol., 57, p. 1847-1849, 1991 #J8D 0 
6. ®L%MK&Z,>r Viio*JJ Y&.m (bkt itfc?IISt) 

g|Haematococcusi*©TX^+-b->^>^iSi6<!:-rs^ Yiiv=f-J 4 K&Jifcit 
7 rfrZi/jUVv*) (???) {i-hv + J 4 FtzlfZft<. fcX+f^W*:^ 

LWS©Tx t^XO&&®\t&*1fiKW&$M& i fatLX FPP £WLT^ 

Brwinia OiivtJJ Y&&M&m\t* 
FPP ^SSi LT> Haematococcus Obkt iftfi^*!l©SR> -fttfr"£> £- * 

;M§S/ct:rTtt<ffifaLfc?8:£$U -T^t>*.> Saccharomyces cer 
evisiae. ,/ — ,/l/££HH Zymomonas mobi 1 i s> feVa$IMB& Agrobacteriu 
m tumefaciens lz Brwinia ©crt jt&?££9AU Ctt£©ifc£W> 

St&LTt^S (Yamano, S. . Ishii. T., Nakagawa. M. , Ikenaga, H., Misawa, 
N., "Metabolic engineering for production of 0 -carotene and lycopene i 
n Saccharomyces cerevisiae". Biosci. Biotech. Biochem. , 58, p. 1112-111 
4, 1994. Misawa, N. , Yamano. S.. Ikenaga, H. , "Production of 0-caroten 
e in Zymomonas mobilis and Agrobacterium tumefaciens by introduction of 
the biosynthesis genes from Brwinia uredovora". Appl. Environ. Microbio 
1., 57, p. 1847-1849. 199K fc«fctf, #M¥3-58786^&IB#KO o 

Lfc*<oT, Brwinia &*.0-h**J4 Y^AB^Ut^mmoD N A (ft 
MtfllwtiHaematococcus a?- J 4 K-^RfcjtiK^bkt) %W.&&t>-&TW\ 
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nf->&fi£fc&g/j:Brwinia uredovora <DcrtE. crtB. crtU crtY ift 
rg^AK^itte^T** Haematococcus©bkt jtfc^&^glB^ 

i** y fc&icii, Erwinia uredovora ©crtE. crtB, crtK crtY 

mB^zsumfrZKmrnn *- mz-ii. pacyciso Kn\Ltzzf^xx 

K (til* (fx pACCAR16AcrtX) , £<fcO\ HaematococcusCDbkt itfe^^frSf 
Jt^*!!^?*- (fll*.^ pBluescript II KS+) IcjfAUc^X; K (fl) 

php5i o&i oei#h) ) vmzrjKi h'zxmm jmiod k>» 

/cli2 Y T i&i&moi&tllT' 3 0-3 7 < CO^*^#T^JW*T^« L, 

^7*1f>^y£j5£j;:£>igttErwinia uredovora ©crtE, crtB. crtU crtY, 
crtZ itfe^fcjitfy h&mA&%&B : ¥-T'&Z> HaematococcusObkt 

fflV>T7^^+^>^>. 4 -*r h-e7*-*>i->Z£.mt Ztz&KlU Erwinia 
uredovora OcrtB> crtBx crtk crtY x crtZit{E^£^tr8rK - £;*;Ji§it'<? ? 

1 2 
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-.(HAtf, PACYC184) ICjf $ K (0>Jx.{i\ pACCAR25AcrtX) > 

«fct>\ HaematococcusObkt itfc^S-S&BrfrS-AH®^? 9- (fll^tf. pBlue 
script I! KS+) IcjfALfc^X; K pHP51) K£*l» 

S (0j*.tf> JM101) K#AU *tt*. 7Vt'>'J>i^P7A7x 
^n-/l/£#t/LBJgii3:A:(i2 0~3 7 t 

<3'-t ko*->x**/ >x 3-t Ko*s/x**y 7i-3+t>f>® 

-t^+tfyf- >£j5fc(C/&ilteErwinia uredovora ©crtE N crtBx crtk crtYx 
crtZ itfcT-fc.fcU^ hgSMJJ^iSfc^T'** HaematococcusObkt Jtfe^* 

4-y h-tf7*-9->f > Ct*<T? -r-f^— +Wft 
fW^i LT> 3' -t KP*fi+*>'^ 3-t Ko*->x**y vRtF^x- 
3 4- If > f- * (i-f T*& * o 

*f£BflODNAT'*S#l8$n^bktilfe^*li^Afc'^-5X $ KpHP51 £ 

mfc**<ttLfcB»J©fc»i>©** : DH5a(pHP51) 

: PERM BP-4757 
Sft*MB : TO 6 *P 7 /§ 26 B 

$ 1 gu, Haenatococcus pluvial is NIES-144 ©yhS^AK^itfe 

? (bkt) ©msae^'j i =» - K*ti*#y^f - k©t s ysiEW^to 
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^2S|{i N mm Haematococcus pluvialis NIBS-144 ©^ hS^AB^itte 

7 (bkt) (Di&mmmt?- K**i*#y^f- k©t s ;KEW*ft, 

£3 0M\ Haematococcus pluvialis NIBS-144 h g^AH^iHc 

^ (bkt) ©^sae^iji ^- h'znz**) ^zf? vot i smmmzKto 

3S4EUix Haematococcus pluvialis NIBS-144 <D*r h&mA&%&& 
? (bkt) ^#tTDNA|S©^*fiE^J^^^o EI4V A, BfeJiO'CttMiS^ K>© 

£5 Kite, »4BHctt<EW*^o 

^7Elti. #jfc£J5fc*IIIffi Brwinia uredovora (Dijuf-J^f V tfJ 

JS8EI(i* Haematococcus pluvialis NIES-144 O^r h£$A&£it£ 
7- (bkt) c#^lfill Brwinia uredovora ©TK&g^ABI&itfc? (crtZ) 

$9Bltt % Haematococcus pluvialis NIES-144 ©>7" hg$Al*&itfc 
? (bkt) &#Mt&j&inft Brwinia uredovora <2>*K£9A!t*it£?- (crtZ) 

SfH OElli, Haematococcus pluvialis NIES-144 ©^ r-S^ABI&ifl 
fc^ (bkt) 4^ti-2fl|©y7X$ KpHP5fc«fctfpHP51 

pHP5(±pSP0RT lie, pHP51tipBluescript II KS+1C, lac ©7D*-*-© 'J 
- Kx;P-££ij*;£fllc#A£nT^*o $»JPS»3Ki3»fS15ta:{i^©ck9lc^BS 

$tlT^$nTt>-5 0 S. Sail; Ss, SstI; P, PstI; Sp, SphI; N, Notl; X, Xb 
al: K. Kpnl; Sa, Sad. 

SB1 \ mU Haematococcus pluvialis NIBS-144 <D4r hJEVAlMHt 
fc? (bkt) ©M^3 K>££0$#©£gE?iJ£&«-rl'-i/3 K 
©KWtt4^1-o 

%\ 2E1I4. Haematococcus pluvialis NIES-144 ©bktitfe?©l. 7kb 
DNAKfK-^^o-yi L/c. 3 MS© Haematococcusicft-f y##r (H 
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m 

U— >1~3 : Haematococcus pluvial is NIES-144 
U— >4—6 : Haematococcus lacustris UTEX294 
U— >7-~9 .Haematococcus lacustris C-392 

K 4> 7 : Hindlll/BM 
l/-> 2. 5, 8 : PstI 
l/->3s 6, 9 : Xbal jlftl 

[«W1] 

itfc^&ifjcffl ^Haematococcus pluvial isti^ MEl&A Jftg • Allffltttt 
7t-7A ( Global Environmental Forum )lCg&$ftTVt£ NIBS-144 #1? 
*S 0 Haematococcus pluvial is£2MrtgH& (Bfx+^ 0.2%, ftK:M» V £ 
A 0. 12%. L-T^^^V 0.04%. ttft-r^S/^ A • 0.02%, 
fittlft*— I* • -bjMDtt 0. 001% > gut #;Uv"> A • ~7k?D^ 0.002%) £ffll\ 
20°C. 12Xrlllin/12ltpnm <20#iB/ii2-s)-Ctt4BHHgf*Lfco X. Haematococc 
us pluvialis ©7X?*1r >^>&fi££^-^-5&l;:> Haematococcus pluvial 
is NIBS-144 *McBH&£g&»£45mM, «tKm-tt • -b7kft^«:fii^»ffi450^ 
Wzt£ & £ HztaA.* 20V. ft&g. 125/zE/m' • sTiKjl2l$|ffl£ft LT. ->xHk 
^E* 1^ 2^ /"c o 

2 ] Haematococcus pluvial is©£DNA<DI882 
Haematococcus pluvialis NIBS-144 400ml <D&&igf&lzfem LX20V. it 
@j£20tfB/V • s. S0«hrW ?Jl<l2tiffflW/128;r$1l£T'mBffligmLtZa 

fco tft*tt©l&#^lCl5ml©$tti&ffi&(0. 1 M Tris-HCl pH8. 0, 0.1 M BDTA. 

1 5 



WO 96/06172 PCI7JP95/01640 

0.25 M NaCl, 0.1 mg/ml Proteinase K )£jja*.TWLl < Jf#U ttXT^tifffl 
&ULtz&. 6000xg. 10#ISK iVT'&bLXttmtiiZWOf&^tZo ±^{-0.6 
f&SO'f V-7a'iS-fr&1IBjL* -20VX'30ftr$tiii1lLtz&.s 7500xg. 15»IHk 
AVZ'&fcLtZo ft*aLfcDNA£W%&2nil©TB$«&(10inM h >J X-HC1 pH8.0, 1 
mM EDTA)-e&fl? U ^S©7x./-;l': ? nn*;l/i*(l:l)<!:2I&LT8>l>U 
±Jf£MtbL*:o ^T80#1©5M NaCl <»:5inlOi^ / -7l/£io*T-20TT'30# 

RUftiOLftSL 12000xg. 15#F«ik ^-eia^LAio glc70%i*/-;i/TtU8 

VXLfct, &&LT0.5 mlOTEiKMdO mM Tris-HCl pH8. 0, 1 mM ED 
TA)lcfe$U 2.5/ulO10 mg/mlORNase A£i0**: fc©£Haematococcus pluvia 

[0yfe#J3] PCRJCfcSHaematococcus pluvial is ^b©crtZ^I^€l^i0^fi 

Erwinia uredovorai Erwinia herbicolacDcrtZitfK^ (Misawa, N. , Nakagawa, 
M. , Kobayashi, K. , Yamano. S. , Izawa, Y. , Nakamura, K., Harashima, K. , 
Elucidation of the Erwinia uredovora carotenoid biosynthetic pathway by 
functional analysis of gene products expressed in Escherichia col i. J. 

Bacteriol., 172, p. 6704-6712, 1990s Hundle, B.S., Beyer, P., Kleinig, H. 

, Englert, C. , Hearst, J. E. , "Carotenoids of Erwinia herbicola and an E 
scherichia coli HB101 strain carrying the Erwinia herbicola carotenoid g 
ene cluster.", Phytochem. Phytobiol., 54. p. 89-93. 1991) lea- K£ft£ 

mm&i&Ltz 0 

No. 1 5* -GGNTGGGGNTGGCAYAARTCNCAYCA-3' 

No. 2 5' -CANCGYTGRTGNACNAGNCCRTCRTG-3' 

No. 3 5* -GCRTASATRAANCCRAARCTNACRCA-3* 
(N: h,G,Ct fzitl , R: MtziiC . Y: CttzM . S : A.G ttzit T) 
No. l(tNo.25LO'No. l£No. 3©il£7*5 'f -7-^-^oTHaematococcus pluvial i 
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s 0£DNAf£#?£|lFS!<h LTPCR (polymerase chain reaction) £fro/c 0 
Wk&tfZtiZIX. #Jl00 ng©Haematococcus pluvial is ©±DNA^. &100 #M 
Oii^y^d'^-s 10 mM MgSO^ lxVent BufferUO mM KC1, 20 mM Tris-HCl ( 
PH8.8), 10 mM (NH^SO*. 2 mM MgS0<. 0.1 X Triton X-100),250 #M dNTP. 2 
U Vent DNA polymerase (New England Biolabs, Inc.) Ktt£<fc?lc&&U 9 
4°C 30#RSk 55t 30&IHK 72t 30tJ>PaT^30i^ -< ? JK Rtf94T 30tP^, 6 
0T 30&I3U 72<C M&fflX'ZQ-* J f JUOKfc&tirVPCR&mK m%.foW}&T' 

CUM 4] Haematococcus pluvial is©£RNA©IH£!{ 
Haematococcus pluvial is N1ES-144 #£800 mlOS*^ittlw*|g| LT20°(\ 

^S20/zB/m i • S > wmvj ? M2&r$w/i2&frmT~mBf8\i&mL, m^r 

6tgf£&i£g£45mM, • -b7KM^S^»S450^M(C/i:S<k 9 lwJta^> 

20°C. ?t?SSl25//E/m ! • sT)^12NrP B 1^ftLfco £g&J0^g#£*ffi U 

3 miCDISOGBN-LS ((#)-? £inx.. M-e5#RM*B U SK0.8 

ml©? n 15tJ>Pi38*L<«#LT3#Rlk ^ST'ScgL/Co 

12000xg. 15#Pb*K 4t^LT±l^«miL> 2 ml©^ 7 ^d/V -/Ufcta 

x-tio^bb. ^ST»caa. i2oooxg, io#r?k ^ts^l/:. ^ht7o%j. 

?y-;UTftl&$£'J >*Lfc&> &&LTlml©TB«ffi&(10mM h U X-HC1 (p 
H8.0), 1 mM BDTA)tC&jj?L^k©£Haematococcus pluvial is©£RNA&&<»: L 

tzo ZOfflmmx-iA »g©£RNAj0<*f S>ttfco 

* 

[HSfe^l 5 ] Haematococcus pluvial isCcDNAffcli'? 4 7? V -©M 
t^fy £X-dT30;*-'<- (^Si£ (#) ) £ffll^THaematococcus pluv 
ialisO^RNA^lmg^b^UA+RNA^ffilSSIL/Co fSSS!^r^(i> *tt©8l!SI&"Jl#© 

o tZo Z.<D$iWj&T*m4u g© # 'J A+mRNA£*»S2 L/c 0 
cDNA©fp$!Hi. U7*r-TM77X$ K>*-fA (G1BC0 BRLtt) 
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A+RNA £BH\ $iJPB»^Notl0lgiJE^J<!:15niersCD^-'J :/dT>£tt S£fi£DNA£ 
•T5-f v-i LTi5*$K^SS£SUPERSCRIPT RTT'*Slii$aDNA££j?£ U K^^TB. col 
i DNA 'J#--fef. B. col i DNA #'J> B.coli DNA RNase H£flH*T2#*§ 

cDNA££j£L/cg^ #JI66**Sall©U >#-£T4 DNA 'J #~tfT?*g££ti\ £ 
**tfM-cDNA©±^*Si&a<Sal I6B{£> # 'J A©T«sa<Not ISBftfctt S J: 9 K L tz 0 
mMfcS&fc^ffl^T, CftbcDNACD1r>f X#B£m\ 0. 7kb~3. 5kb©*Gffl©# 
UZ&lbtZo C©£®©cDNA#j28 ng£35ng©cDNAf&£'<? 9 -pSPORT I (GIBCO 
BRL tt) £ NotI £SalIT'#MfcLA:*>©<!: ; £±*-y HC&*nT^* y < f- 
i/ 3 >'<y 7 T-(50mM h UX-HC1 pH 7.6, lOmM MgC 1 2. lmM ATP, lmM DTT, 5 
X PEG 8000)RtFT4 DNA 'J iJ—V ^fflUT? 4 Y- > a >Lfco C©cDNAHSt^ 
?*-pSP0RT SallSBflL0±«Efclac7'o*-^-^fet.. *HffirtTcDNA 
^fg^^-fr-SCi^-e^-S^^^-T^So Jfcfc^'f Y-'>s >LfcDNA&&£ 
iTt^t, Molecular Cloning 2nd edition : Cold Spring Harbor Laborator 
y .1.21-1.41 a9W(Djjft\Z'&-oXmMlfc*;Mm (Escherichia coli) DH5a 

Z>&£XMtbtz&.^ Ilolecular Cloning 2nd edition : Cold Spring Harbor Lab 
oratory . 1.21-1.41 (1989)©#&l;:ftl,\ 75X5 KDNA&M Ltz 0 -e©*S£s 
0.6 ogOy^Xi KDNA*<ff£tU Cft^-Haematococcus pluvial is©cDNA?£^-7 

'J-£ LtLo 
(1) 0-rto^>g£*M©ft82 

Brwinia uredovora©crtZ£Jfl«©# a 4 K£-Ji£itfe^i*(crtX, crtE, crtY, 
crtl, crtBitfe^) Z^-fi^y Xi KpCAR16 (Misawa,N. Nakagawa.M. Kobaya 
shi. K. Yamano.S. izawa,Y. Nakamura, K. Haras hi ma, K. . "Elucidation of the 
Brwinia uredovora carotenoid biosynthetic pathway by functional analysi 
s of gene products expressed in Escherichia coli." J. Bacteriol., 172, p. 
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6704-6712. 1990, &tf4$£'*I*P3-58786-f &fB#M) ©BstBllftHb. Klenowg£&& 
m. 'J#--t?KJE£fT-*> CtlZk*), crtmfciFZ7 1/-AJ/7 «fc 9 $ 
-tir/c^. /8-7f7P^>^{C^g/«£crtE.crtY. crtl.crtBjKKT^-^tr 6. OkbcDAs 
p718(Kpn!)-EcoRl»rn-^tU»3tllL^o CCDMftZKMm*? ?-pACYC184 (ATCC 
37033 «k*)A#) ©EcoRVg&ttKtfiAU gW<ht-*75X$ K (pACCAR16Acrl 
X£<ft£) £»fc 0 C©pACCAR16AcrtX^W^"*^8i»^> ^o5A7i-3- 

(2) y rg$AB*&ite"f©X? 

K(i> Haematococcus pluvial isrt"C«0 of y^-lt^ 
j£$*l££:%;i£;rlS (Britton.C, "Biosynthesis of carotenoids". Plant P 
igments. London, Academic Press. 1988, p. 133-182. (Goodwin, T.W. ed. ) # 
m) o ^C-Zr±IB©pACCARl6AcrtX^^J#1-SAJlSJMl0l*</9-/JPf > (8 
fe) ZM&.'fr 2> C t £?'Jffl LTx CO^H@ll-±IBOcDNA^7'1'7*5 'J 

AU ^£ftA:^fifc&<*©feP3£rt:>!>^ ^ ^a£A8&>i£3 1; £:#&L/c* 
l!®£;*:5";-->:5''L/co ^h8a<£A£tt> *r Y*xi*J 4 K©*>£oT* 
5 * > * >f- *l<5 £ *»B©feHJ&* £ -ft d f- >Oflfe*> £ ii > 

*?\ Molecular Cloning 2nd edition : Cold Spring Harbor Laboratory , 1. 
21-1.41 (1989)CD;£j££/fH\ pACCAR16AcrtX£&jrrSA-HSJM101©3>t* 

C©3 > br> h-b^l mlKttLTlOO ngOcDNA^? W 7*7 'J 
AU &4:Eil©ffmSte&#K*fLTX* •J-->W» < £18*'* 

P5t#^L^ 0 75 X$ KpHPSCfllfift^^lOEItC^-ro 
[£«E0>J 7 ] >r b S^AK^itfi^O *»E'aj*J£ 

pHP51C|#A cF tlT <£ Haematococcus pluvial is£*fc<Dl. 7 kb cDNA£#]Hi£& 
SaliiXbal T'VW&L. ^Jiffi^? 9 -pBluescript II KS + fcir/pBluescrip 
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t II SK+©SalI/Xbal8B&K«»ALT> 2n®7vXi K (pHP51:fc <fctfpHP52<!: 

zmtz 0 :©^/7X; h'pmi<Dmmm%t&®£%iow\z*to phpsi 

S.CJ c pHP52ti, ZM'tl, ±fecDNA*<lac6Dya*-^~©'>l- K*-rt/-*g#S 
2f|S] &. T}%. V K ^ 2ff£] |c f$ A $ *IT tz *> O T' * Z o 

MLt^X; KpHP51> pHP52lco^TOT©¥lISTfi 
-rs*fc£M#ttSJ!£m\ *ftblw-o^T&gB5?iJ©&5££fTort:o pHP51teS 
acIiXbaliTtffSU pHP52(iKpnI <hSal I tX'ftMLtz®.. 7 * J -JV/ 9 v v 
*;i/A$fcb£m^ 3.9 J -^ttiBK<fc9DNA£lslJKLfco *tt-en©DNA£ioo 
#14)Bxolll'<9 7 T - (50mM Tris-HCl, lOOmM NaCl, 5mM MgCh, lOmM 2-/ )V 
■h? Vz-fJ -VK pH8. 0) IC&j|¥U I8OJ.-7 KDBxolII? 9 
T37^CT«&Lfco 30f4>r£lC10 #1 (DKfefeWL*-* >7 U V** LT, 10 #1 
©MB'<-y7r- (40mM NaCi. 2mM ZnCh, 10%? *) -bv-Jl, pH4.5) ©Ao/c 

LfcDNAliKlenow 9*iS5£¥*t<l:U 16^, — -Y Y— *> a L 

/c&x *8iffiDH5a£JBM<E&LA:o *f£*l*:«* ©? P- >lcol»T75X ^ 
K£H»U 7^5^ Y'<4ti/Xf-l> m) <D&1L7 ? j ■? 4 9 Jlis - 9 

&£Lfcl677 igg** (bp) ^£ttS^iJ£#4EI*i<fctft&5BI (E*J« 

D (E?iJ^© 6 ICt^) % m3El©C~D <.W3\&<OW3\&m IZtp 

^bni:ti^> ^3ElOCJ:<9Tat©^«JCo^T{i±IB©^-y>U- 
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[HISW8-] y hga|A»^iSfe^o^3 K>©&£ 

&3 K><D^itt*<*5giJfi(i. 5^ (&g&gl68~170, 189-191, 264-26 
6. 348-350, 423-425, cnbaftgti&UHtC^TftTffifrftT^So ) 
fttEtZ* J&llEWc^-rilfltes K>£*JttS168 &©&g, 189 ffiOtS, *J<k 
#264 &©igg(i, ZtlZ'tl. J&1EKDA, S&2EI0B. fc«k tflfl 3 EIO CT'* 
L tztiL B ft ^ 1 6 o * C -e«ff& ? > '< ? S i L T&g ft £ /jN^g^^-^^-r -5 

^L/:^^; K£8cgft®!Lfco S&llEllcti, tnfno^r^XJ KO 
#^£±$K5fcSS8&g£^to C*l£7-7*S K££$0I6 KieL/wpACCAR16Acr 

30. 27. 31, 37, 12©*&75X$ K£#grf **B§&Ttt, 
>©g££<ig«>t>ftfc)&<, #*|10, 6. 38T-ttIg*6btttt3&>o*:o #-^12 

i£g&g264~266<DBI?#}3 K>ATG©A£T*a<K£LT I 
ft£ftU£LfcI£<cc©ATGjmG£tt!K ^HS(iGTGT K><hSSI8L9 

C©ftg©M&=> K Y&f&frtet iT^Z £%z.hti 

ho L*ui>^T264-266©M&:J K >OT©^-^> 'J -f* U-A (J& 

( l ) fj yf-xomfe 

pmttzttpmi&0-jja*>m$L*mmiMouzmAitzb(D (*mm (pac 

CAR16AcrtX, pHP5£ /ctepHP51)) (Jgfe£M LT<^ *) £ 150 #g/in]©7> 
t'v/'J > (Ap, B^?&5S!mSS!) > 30 /*g/ml©*D7A7x-3-/U (Co, 
SI) , 1 mM©IPTG, 7 rag PeS0« • 7H t 0fc«fctf9. 3 ng Na, ♦ BDTA££fr2YW& 
(1.6% h 'J/h^ hx+X. 0.5% NaCl) 2U y h;UT, 30^, 24 
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~3o&fmm%Ltzo mmm^&mitzmt**. 300 mi or-t h > k «t t> a to 

L/Co Ctl£«fi&(£x 200 ml O^DD^W/^-^ (9/0 ^ElfiHi 
U S«SS£SL/co Cftt'MO^ao*^/^;-* (9/1) ic 

(50/1) T-®Bf}-rSC<hlc<fc*K (TLC) £ 

ffofco C©TLC !C<fctK X#* h tiRfffiO. 53. 0.78fc<fctfl ©3^*? 

^n/co «sm$n/cfe*^©75%jcis^'r4Sfcs^^fefe^(Rf (to. 53) 

y (9/0 (cftjffgu -tr7 7 "r 9 XLH-2027 5 A ? o ^7 h/77>( - (15 X 3 
00 mm) ^aaW/>^y-^ (9/1) -eKBB&tii1-SC 

#6££2 mg^f/Co *«*®*fl-«H&*'** h;K 'H-NMR, PD-MSX'** (m 
/e 564) . v"J ij ? Jl>nC<D&Wim va*Jl>J*/J 9 J-fr (50/ 

l) TS^-ZfRf fflO. 53) iJ> * XDUmSh (BASFttg!) tt*<~C— 

itz. &tnizte&zntz&%<Dio%izm%irz>fc&&% (tlc -eRffito.78) 

h;K i/'J #YA'TLC©#S&& Waa**J»// 9 (50/1) T' 

®§S"eRffllO. 78) x *>«fcy\ ?HR6//C. 8 (3.9 x 300 nun) 

X*±$!!) £ffl^fcHPLC©#»fi (7-fe HJ;U/ **J-to/l- -fv^J-fr 
(90/6/4)T'1.0 ml/ ^®»«T-©SMTRT16^) JiOi**;' > (8fi£iU*3! 8 

SflJKtttti 3 tt/c 1 5 %tCffi^-r<5Hfefe3!l (TLC T'RUI 
D&TLC fl>McD> ^ 7 —JlKj&frLtzo COMOf^ 

h/K *>«i:0\ J'<'*v9W 6(i Cm (3.9 X 300 mm) 
-XttIS!) ^HPLCO^SfrS: (7* h-HJ*/> *y-*/2-7D/V- 

* 

;p (90/6/4) T'1.0 ml/ $T©i&£T?0M0f!TRT62#) /9- a ^ 

00 (*-7H*5 >xg!> Sigma ttSS!) i^T-ifc L/c/^a6s *«5Stt*E£© 

(2) 7^^+t>f ^ 4-4r Hf7**>f->©HS 
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f 7** >T>m£.*mmZfc0 l i:olzlXft®ltZo ttit>*>. Br. uredov 
ora © &1jur; << h'&ffci&B¥®$:&t X Z K PCAR25 (Misawa. N. , 
Nakagawa, M. . Kobayashi, K. , Yamano, S. , Izawa, Y. , Nakamura, K. , Harash 
ima K. , "Elucidation of the Erwinia uredovora carotenoid biosynthetic p 
athway by functional analysis of gene products expressed in Escherichia 
coli". J. Bacterid.. 172. p. 6704-6712, 199043 <fctf#Bfl 5 P3-58786 ^§0 © 
$i]PB&& BstBII mil. Klenow fragment mm. 5 << Y- : s a >Rfc*Vs o C t 
K<fc?K crtX U-A->7 M:«t^g$t^ -fe*7*-9" >f- >g 

£K<&gfccrtB. crtB. crtU crtY, crtZ itfE^&tr 6.5 kb Asp718 (Kpnl) 
-BcoRI mXZVWKltZo *LT. ZQtirKttmmH fi- PACYC184 © E 
coRVSB&dif AU gdHit^^^i K (pACCAR25AcrtX£<ft£) 

pHP5£ fc(ipHP51^ C©-tf 7*1f >^ ygfcMISJMlOllcajA L*: {>© (*H 
ffi (pACCAR25AcrtX. pHP5£ fc(ipHP51)) (@fe£MLT^£) £ 150 #g/ml 
Ap, 30 #g/ml©Cnk 1 mM©IPTG. 7 mg FeSO, • 7H 4 Ofc J: #9. 3 mg Na 4 • BDTA£ 

$&2rr%m aexh'/yh> % l^^-xhx+x, 0.5% Naco 2U-, 

30t. 24~30I^P B ^SL/Co $S^>b$il^e^. 300 ml OTt^C 
«tt)}fitBL^o Ctt£M&, 200 ml O^ootwl/A/^ *J-)l (9/1) T 
2@}l&tfiU SSSS&SLfco ££IC, £*l£>]vft©? dd*;1/A/^ 
(9/1) ^;U^tt^©^Kffl'>'J^y;HLC^l/- r-£ffll*T. *n 

a*7bA/^ ^y-yl/ (15/1) TUBALS £ <blcJ:*K »i^7^57^ 
- (TLC) Z'n-otZo 5c©Sfefe*{is C©TLClC<fc*K £gX#», Hi. Rf it 
0.40. 0.54. 0.72©3X#y hlZfifrtltZo :nt.0fe*^ TLC7b-h^t>. 
ri^iofcgL 4>fi©? OD*;UA/y $ J-)\, (9/1) IC^ISL, t7 r f^ 
XLH-20#5 A? F^77>r - (15 X 300 mm) fcsNt. ?od^A/^ 

y-;i> (9/D T'jg^aj-rsciict*). ^©^ls^ -en-rn, $1 mg, 

1 mg. 2 mgf§/Co 

}filii£ttA:fe&£#©#j¥$H;:*§i!i?-£ Rf 0.72©fe$(i, 
hA>, 'H-NMR. FD-MSX^ (m/e 596) ©*£HJ:*K 7X^+fyf >i 
|5]-©5p®«iii^}tofc©T'*SCi*<B^^{cfj:-,/c 0 ^x^yl/jn-x 
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;U : -;U : -;U 5:5:2 fcflMPU CDX^? h Lfc 

<hC*K 3S. 3*S©Sl^«|ji^<i:-5C<i:*<^^oA:^J6. #<8|g£7X* > 
f-> CBfxtxUilil 8 El#fl9) ilil^L^o Rf 0.54©fe£(i> *©£*»•- 

bUSX^^ 'H-NMR, FD-MSX^^ Yfr (m/e 582) > If *) ii V 

C(D&W}& (9aa*sA>J»/S 9 J-fr (15/1) TCKMTRf fitO. 54) ^t>. 4 

-yht7+t>fy (#i6iUi# 8 g|#jra) ilnl^^tlfco tt*>\ Rf 0.40 

*y-;U (15/1) T*MMT'©Rf fitO. 40) . fc<fctf, J '<'< v ?HR6/zC, ,(3. 9 x 
300 mm) (-)*-^-Xlil!) WcHPLC©#«?S (74r h — h L) y*y 
-;l>/2- yo/NV-;K90/6/4)-ri.O ml/ #©&gT*©®8J-eRT6. 5 #) *<-if7 
J-y-yf-XDmmSh (BASFttS?) £"f ^T-SLfcfc*^ #$lg(i*Srt;©-fe"7 

(ffiitiUilfS 8 0#J8O t^SCi^o/Co 

^Sfe^q 9 (1) £19, Haematococcus©^ hS*A»*iie^ (bkt) 

hg$AB*3!S (/S-carotene ketolase) &3- KLTUSCtttWfrT?** (» 
8El#Ha) o C©C<b(;U o©»&BKT#/8--f *y >&©4ffcBLtM' fe©y 

Haematococcus©bktitft^l;L ±lB©JStt£l*Uw» ^7**>f >*SRi L 

*>A»£ (zeaxanthin ketolase) *a- K LTU* C t«^*>T*S UMH 
#M) o CCCilil^S^BKTj&^-K.tfS' - fc Kn + f-jS-f *V 

US, C©J;9^IKtl^o»^t)^ir*tot>nTU/j:^o^t>©-Zr*So Ltz 
*<oT> Haematococcus©^ hS^Afit*ae?bkt ti. 3(4 (3* ft) ©&g 

(ctKKS*<^*pltus Lruttu^*^*)»jtt< . 4tt (4' m ©yf-u> 

S£u£te9^ h glutei- Z 0-4 *J >£A:(i3-b h'v* t s-ff-4*; >J$ 



2 4 



WO 96/06172 PCT/JW5/01640 

*T hg^A&& (£-ionone or 3-hydroxy-£-ionone ring ketolase) K 

— tl^^i»8SBrwinia^>^^eiRhodobacterO*Df-y K£&itfe 

Brwinia© >J 3 e>3Mfc»*£fc : P"e**crtYtt »J 3 y^©*»fo*BH 
LTSI'fkt-So Lfci^T, Rhodobacter©? 4 hx^r^f-a^-Hfitfi^crt 

CfttcBrwinia©crtY£ftffi V -f o X# 1/ yicfctf 5 U a b* > <t#i§ 

o^>^^1"S (Linden, H., Misawa, N., Charoovitz, D.. Pecker, I. , Hir 
schberg, J., Sandmann, C. , "Functional complementation in Bscherichi 
a coli of different phytoene desaturase genes and analysis of accumulate 
d carotenes". Z. Naturforsch. . 46c, p. 1045-1051, 1991) o *tz, ^fSWlc 

ft* U ^7^-9->f->KUWfmtSiti>t, £-f lo^hg 
ri<#A3ft*:4 -*r H?7**>f->^rtt*n*. cn«> BKTa<gK©¥#© 

ErwiniaOcrtE. crtB. crtk crtY. crtZ 2>*l®&lt- fiu*EL*:<fc 

1 OTK&Sa^ASftfcjB-:? 'j:/h*1f>^>£tfctfi-rSC C© 
Cttt, *CKBKW?£ir-S<!:> fi -9 U 7* h >^ ^£S!S £ LT3' -t K 

ft t> ICBKWttffl LT 7 x - => *1r > f- y*kte$L-$ & C <b *<T? £ * i S C £ 
te^fc#>-efc£<!:!ifc>ft*o Haematococcus<i:&X,-ett^tfj7 
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mrnVnt LT3- 1 Ko*->x**y y+y x^+lr >^;0<&ajSnT^3 
(Andrewes, A. C. , Phaff, H. J., Starr, M. P.. "Carotenoids of Phaffia 
rhodozyma, a red-pigmen ted fermenting yeast". Phytochemistry, 15, p. 100 
3-1007, 1976) o Kl®:iJ:»)> % 8 EMC?* LfcTX* *V >^ >©±gftJlf 

m&mi 0] {fe<D*|gfcHaematococcus ©SfefeftDN Alzftt ZWsftVr 
ffeOiii&Haematococcus ©Ssfe^±JC^«| L/cbkt tftW\&*7Ftmmm h 
tlZfr&fr&tftPtLtzo £MM2 icfgLfcHaematococcus pluvialis NlBS-144^ 
£©£DN AOfflWZtWl C^T'Haematococcus lacustris UTBX 294£Haemat 
ococcus lacustris C-392 ©£DN A&H5IS2U Haema toco ecus pluvialis NIBS 
-1440±DNA£&K#J|!B»JfcHindIIU Pstl££l>(iXbaI-l:$H'fcLT7# o- 
xy^m^Sftft1r5)-«ttA:o #8IL/cDNAWrtt£0.5N NaOH . 1.511 NaCl © 

xDenhardU 5 xSSC . 0. 2% SOS, lOO^g/ml ssDNA) iCjgU 4 B#PtSk 55^7? 
^WWr'J ^-f-tf-i/a >£rf ofco &Kbktitfc^©l.7kb DNA»r)t£Meg 
aprime™ DNA labelling systems (7v> + A) «»: [ a - ' 1 P] dCTP (-110 TBq/ 
mmol) ££ffl^T&£MfcU ±Be©7 U'W 7 'J *W -t?- f a >&&lcjta*.T16 

XSSC. 0. 1XSDS T?60t\ 1 tflHftfrU h5 v>*r5 7 4 -IC«fcoTf§|5] 
feZTZt^yi-foZtiifo Lfc*S&, Haematococcus pluvialis NlES-144T(ix Hi 
ndlll M#m5kb. lOkb, 1.9kb % PstI^jk^T(i6. lkb N 3.3kb . 2.8kb % 
2.3kb > 2.0kb , 1. 4kb , 0.8kb % XbalM#J7r5. lkb tDfiCfflc^n-t'tl^^ •> 
fi-JW&btltZo CtUCftLT. Haematococcus lacustris UTBX 294T(i. Hi 
ndlll ^^"C15kb> 7. 7kb . 1. 9kb % Pstl^MT'lOkb, 5.0kb % 4.0kb % 3. 
4kb % 2.9kb „ 1. 5kb > 0. 82kb. XbaIfNMTtt20kb£t±©D N AOtfcfif tilf K 
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J £tl*ftl%H,* 's7i-Jl'i)mz>tl, Haematococcus lacustris C-392 Tti. Hindi I 
I $ft$t-Crl5kb. 12kb. 1. 9kb , Ps 1 1 flHb40n? (*6. 5 kb. 3.0kb N 2.3kb > 2.0k 
b % 1.4kb , 0.8kb , XbaiM$IT'Ji5. 3 kb©&lic**i-e'ft9£l,> •> y+Jlim 

* ©fife© *r b&ZSti'r Vhuf-J 4 K©££/jftt6£tt 
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: 1 
: 3 2 0 

mnom .run 



: Haematococcus pluvial is 
: NIES-144 

Met His Val Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu 

15 10 15 

Ala Ala Ala Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin 

20 25 30 

Tyr His Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu 

35 40 45 

Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie Thr 

50 55 60 

Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 

65 70 75 

Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His 

80 85 90 

Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser 

95 100 105 
Ser Ser Leu Leu His lie Ala Ala Val Phe He Val Leu Glu Phe 

110 115 120 

Leu Tyr Thr Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly 

125 130 135 
Thr lie Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
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140 145 150 

» 

lie Cys lie Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His 

155 160 165 
Arg Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys 

170 175 180 
Asp Pro Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe 

185 190 195 
Ala Ser Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg 

200 205 210 

Leu Ala Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met 

215 220 225 

Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala 

230 235 240 

Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu 

245 250 255 

Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala 

260 265 270 

Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr 

275 280 285 

His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro 

290 295 300 

Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 

305 310 315 
Val Pro Ala Leu Ala 

320 



'm^ : 2 
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h # u *J - : jt$# 

: Haematococcus pluvial is 
: NIES-144 

Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser Ser Pro Asp 

15 10 15 

Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser Glu Ser 

20 25 30 

Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro 

» 

35 40 45 

Ala Ser Asp Ala Lys Gly He Thr Met Ala Leu Thr He lie Gly 

50 55 60 

Thr Trp Thr Ala Val Phe Leu His Ala He Phe Gin He Arg Leu 

65 70 75 

Pro Thr Ser Met Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala 

80 85 90 

Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser Leu Leu His He Ala 

95 100 105 

Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe lie 

110 115 120 

Thr Thr His Asp Ala Met His Gly Thr He Ala Leu Arg His Arg 

125 130 135 

Gin Leu Asn Asp Leu Leu Gly Asn lie Cys lie Ser Leu Tyr Ala 

140 145 150 

Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His His 

155 160 165 

Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 
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170 175 180 

Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr 

185 190 195 

Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val 

200 205 210 

Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met 

215 220 225 

Ala Ala Ala Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly 

230 235 240 

Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 

245 250 255 

Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp 

260 265 270 

Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu 

275 280 285 

His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys 

290 295 300 

Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 

305 310 313 



&?|J#^: 3 
E?iJ©£$ : 2 88 

m&iom : 7 i j m 

£M%3 : Haematococcus pluvialis 
: NIES-144 
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Met Pro Ser Glu Ser 
1 5 
Ala Tyr Lys Pro Pro 

20 

Leu Thr lie He Gly 

35 

Phe Gin He Arg Leu 

50 

Pro Val Ser Glu Ala 

65 

Leu Leu His lie Ala 

80 

Thr Gly Leu Phe He 

95 

Ala Leu Arg His Arg 

110 

He Ser Leu Tyr Ala 

125 

His Trp Glu His His 

140 

Asp Phe His Lys Gly 

155 

Phe Met Ser Ser Tyr 

170 

Trp Trp Ala Val Val 

185 

Leu Leu Val Phe Met 

200 

Leu Phe Tyr Phe Gly 



Ser Asp Ala Ala Arg Pro 

10 

Ala Ser Asp Ala Lys Gly 

25 

Thr Trp Thr Ala Val Phe 

40 

Pro Thr Ser Met Asp Gin 

55 

Thr Ala Gin Leu Leu Gly 

70 

Ala Val Phe He Val Leu 

85 

Thr Thr His Asp Ala Met 

100 

Gin Leu Asn Asp Leu Leu 

115 

Trp Phe Asp Tyr Ser Met 

130 

Asn His Thr Gly Glu Val 

145 

Asn Pro Gly Leu Val Pro 

160 

Met Ser Leu Trp Gin Phe 

175 

Met Gin Met Leu Gly Ala 

190 

Ala Ala Ala Pro He Leu 

205 

Thr Tyr Leu Pro His Lys 



PCT/JP95/01640 

Ala Leu Lys His 

15 

He Thr Met Ala 

30 

Leu His Ala He 

45 

Leu His Trp Leu 

60 

Gly Ser Ser Ser 

75 

Glu Phe Leu Tyr 

90 

His Gly Thr He 

105 

Gly Asn He Cys 

120 

Leu His Arg Lys 

135 

Gly Lys Asp Pro 

150 

Trp Phe Ala Ser 

165 

Ala Arg Leu Ala 

180 

Pro Met Ala Asn 

195 

Ser Ala Phe Arg 

210 

Pro Glu Pro Gly 
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215 220 225 
Pro Ala Ala Cly Ser Gin Val Met Ala Trp Phe Arg Ala Lys Thr 

230 235 240 

Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe 

245 250 255 

Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp 

260 265 270 

Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro 

275 280 285 

Ala Leu Ala 
288 



K?'J#^- : 4 

ge?iJCDg£ : 1 6 7 7 
m<D& : 

mmomm c dna 

: Haema toco ecus pluvial is 
: NIES-144 

CGGGGCAACT CAAGAAATTC AACAGCTGCA AGCGCGCCCC AGCCTCACAG CGCCAAGTGA 60 
GCTATCGACG TGGTTGTGAG CGCTCGACGT GGTCCACTGA CGGGCCTGTG AGCCTCTGCG 120 
CTCCGTCCTC TGCCAAATCT CGCGTCGGGG CCTGCCTAAG TCGAAGAATG CAC GTC 176 

Met His Val 
1 

GCA TCG GCA CTA ATG GTC GAG CAG AAA GGC AGT GAG GCA GCT GCT TCC 224 
Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser 
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5 10 15 

AGC CCA CAC GTC TTG AGA GCG TGG GCG ACA CAG TAT CAC ATG CCA TCC 272 
Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser 
20 25 30 35 

GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC GCC TAC AAA CCT 320 
Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro 

40 45 50 

CCA GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG CTG ACC ATC ATT GGC 368 
Pro Ala Ser Asp Ala Lys Gly He Thr Met Ala Leu Thr He lie Gly 

55 60 65 

ACC TGG ACC GCA GTG TTT TTA CAC GCA ATA TTT CAA ATC AGG CTA CCG 416 
Thr Trp Thr Ala Val Phe Leu His Ala He Phe Gin lie Arg Leu Pro 

70 75 80 

ACA TCC ATG GAC CAG CTT CAC TGG TTG CCT GTG TCC GAA GCC ACA GCC 464 
Thr Ser Met Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala 

85 90 95 

CAG CTT TTG GGC GGA AGC AGC AGC CTA CTG CAC ATC GCT GCA GTC TTC 512 
Gin Leu Leu Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe 
100 105 110 115 

ATT GTA CTT GAG TTC CTG TAC ACT GGT CTA TTC ATC ACC ACA CAT GAC 560 
He Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp 

120 125 130 

GCA ATG CAT GGC ACC ATA GCT TTG AGG CAC AGG CAG CTC AAT GAT CTC 608 
Ala Met His Gly Thr He Ala Leu Arg His Arg Gin Leu Asn Asp Leu 

135 140 145 

CTT GGC AAC ATC TGC ATA TCA CTG TAC GCC TGG TTT GAC TAC AGC ATG 656 
Leu Gly Asn He Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met 

150 155 160 

CTG CAT CGC AAG CAC TGG GAG CAC CAC AAC CAT ACT GGC GAA GTG GGG 704 

3 4 
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Leu His Arg Lys His Trp Glu His His Asn His Thr Giy Glu Val Gly 

165 170 175 

AAA GAC CCT GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG TTC 752 
Lys Asp Pro Asp Phe His Lys Gly Asn Pro Gly Leu Vai Pro Trp Phe 
180 185 190 195 

GCC AGC TTC ATG TCC AGC TAC ATG TCC CTG TGG CAG TTT GCC CGG CTG 800 
Ala Ser Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu 

200 205 210 

GCA TGG TGG GCA GTG GTG ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT 848 

p 

Ala Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn 

215 220 225 

CTC CTA GTC TTC ATG GCT GCA GCC CCA ATC TTG TCA GCA TTC CGC CTC 896 
Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala Phe Arg Leu 

230 235 240 

TTC TAC TTC GGC ACT TAC CTG CCA CAC AAG CCT GAG CCA GGC CCT GCA 944 
Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala 

245 250 255 

GCA GGC TCT CAG GTG ATG GCC TGG TTC AGG GCC AAG ACA ACT GAG GCA 992 
Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala 
260 265 270 275 

TCT GAT GTG ATG AGT TTC CTG ACA TGC TAC CAC TTT GAC CTG CAC TGG 1040 
Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp 

280 285 290 

GAG CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG CAG CTG CCC CAC TGC 1088 
Glu His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys 

295 300 305 

CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA TGACCTGGTC 1137 
Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 
310 315 320 
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CCTCCGCTGG TGACCCAGCG TCTGCACAAG 

GGCAGCGCAG TGCACTCTCA GCCTGTATGG 

TGCCACTGAG CACTGGGCGT GCTACTGAGC 

TACTGACAAT GGGCGTGCTA CTGGGGTCTG 

AGTAGCGGTG GCCAACGTCA TGTGGATGGT 

TTTGAGAGGG CTAAGTTATA AATCGCATGC 

GGGAAATCCC TTCGAGAGTG ATTATGGGAC 

TCAGCAGCAG TACTTAGTGA GGGTGAGAGC 

AACCTGGTCA GCGAGGTGAA CAGCCTGTAA 
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AGTGTCATGC TACAGGGTGC TGCGGCCAGT 1197 

GGCTACCGCT GTGCCACTGA GCACTGGGCA 1257 

AATGGGCGTG CTACTGAGCA ATGGGCGTGC 1317 

GCAGTGGCTA GGATGGAGTT TGATGCATTC 1377 

GGAAGTGCTG AGGGGTTTAG GCAGCCGGCA 1437 

TGCTCATGCG CACATATCTG CACACAGCCA 1497 

ACTTGTATTG GTTTCGTGCT ATTGTTTTAT 1557 

AGGGTGGTGA GAGTGGAGTG AGTGAGTATG 1617 

TGAATGACTC TGTCTAAAAA AAAAAAAAAA 1677 



Beyj#^- : 5 

K?iJ©g£ : 9 6 3 

: 

h # D V - : 

: c DNA 

l kM%* : Haematococcus pluvial is 
: NIES-144 

ATG CAC GTC GCA TCG GCA CTA ATG GTC GAG CAG AAA GGC ACT GAG 45 
Met His Val Ala Ser Ala Leu Met Val Glu Gin Lys Gly Ser Glu 
15 10 15 

GCA GCT GCT TCC AGC CCA GAC GTC TTG AGA GCG TGG GCG ACA CAG 90 
Ala Ala Ala Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin 

20 25 30 

TAT CAC ATG CCA TCC GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA 135 
Tyr His Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu 

35 40 45 



3 6 



WO 96/06172 PCT/JP95/01640 

AAG CAC GCC TAC AAA CCT CCA GCA TCT GAC GCC AAG GGC ATC ACG 180 
Lys His AJa Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie Thr 

50 55 60 

ATG GCG CTG ACC ATC ATT GGC ACC TGG ACC GCA GTG TTT TTA CAC 225 
Met Ala Leu Thr lie lie Gly Thr Trp Thr Ala Val Phe Leu His 

65 70 75 

GCA ATA TTT CAA ATC AGG CTA CCG ACA TCC ATG GAC CAG CTT CAC 270 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His 

80 85 90 

TGG TTG CCT GTG TCC GAA GCC ACA GCC CAG CTT TTG GGC GGA AGC 315 
Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser 

95 100 105 

AGC AGC CTA CTG CAC ATC GCT GCA GTC TTC ATT GTA CTT GAG TTC 360 
Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe 

110 115 120 

CTG TAC ACT GGT CTA TTC ATC ACC ACA CAT GAC GCA ATG CAT GGC 405 
Leu Tyr Thr Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly 

125 130 135 

ACC ATA GCT TTG AGG CAC AGG CAG CTC AAT GAT CTC CTT GGC AAC 450 
Thr lie Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 

140 145 150 

ATC TGC ATA TCA CTG TAC GCC TGG TTT GAC TAC AGC ATG CTG CAT 495 
He Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His 

155 160 165 

CGC AAG CAC TGG GAG CAC CAC AAC CAT ACT GGC GAA GTG GGG AAA 540 
Arg Lys His Trp Glu His His Asn His Thr Gly Giu Val Gly Lys 

170 175 180 

GAC CCT GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG TTC 585 
Asp Pro Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe 
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185 190 195 

GCC AGC TTC ATG TCC AGC TAC ATG TCC CTG TGG CAG TTT GCC CGG 630 
Ala Scr Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg 

200 205 210 

CTG GCA TGG TGG GCA GTG GTG ATG CAA ATG CTG GGG GCG CCC ATG 675 
Leu Ala Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met 

215 220 225 

GCA AAT CTC CTA GTC TTC ATG GCT GCA GCC CCA ATC TTG TCA GCA 720 
Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro lie Leu Ser Ala 

230 235 240 

TTC CGC CTC TTC TAC TTC GGC ACT TAC CTG CCA CAC AAG CCT GAG 765 
Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu 

245 250 255 

CCA GGC CCT GCA GCA GGC TCT CAG GTG ATG GCC TGG TTC AGG GCC 810 
Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala 

260 265 270 

AAG ACA ACT GAG GCA TCT GAT GTG ATG AGT TTC CTG ACA TGC TAC 855 
Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr 

275 280 285 

CAC TTT GAC CTG CAC TGG GAG CAC CAC AGG TGG CCC TTT GCC CCC 900 
His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro 

290 295 300 

TGG TGG CAG CTG CCC CAC TGC CGC CGC CTG TCC GGG CGT GGC CTG 945 

■ 

Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 

■ 

305 310 315 

GTG CCT GCC TTG GCA TGA 963 
Val Pro Ala Leu Ala 

320 
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: 6 

K7>J©££ : 9 4 2 

&m<DW. : mm 

h$Vit- : 
afi^lJOSJ® : c DNA 

: Haematococcus pluvial is 
: NIBS-144 

ATG GTC GAG CAG AAA GGC AGT GAG GCA GCT GCT TCC AGC CCA GAC 45 
Met Val Glu Gin Lys Gly Ser Glu Ala Ala Ala Ser Ser Pro Asp 
1 5 10 15 

GTC TTG AGA GCG TGG GCG ACA CAG TAT CAC ATG CCA TCC GAG TCG 90 
Val Leu Arg Ala Trp Ala Thr Gin Tyr His Met Pro Ser Glu Ser 

20 25 30 

TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC GCC TAC AAA CCT CCA 135 
Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro 

35 40 45 

GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG CTG ACC ATC ATT GGC 180 
Ala Ser Asp Ala Lys Gly He Thr Met Ala Leu Thr lie lie Gly 

50 55 60 

ACC TGG ACC GCA GTG TTT TTA CAC GCA ATA TTT CAA ATC AGG CTA 225 
Thr Trp Thr Ala Val Phe Leu His Ala He Plie Gin He Arg Leu 

65 70 75 

CCG ACA TCC ATG GAC CAG CTT CAC TGG TTG CCT GTG TCC GAA GCC 270 
Pro Thr Ser Met Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala 

80 85 90 

ACA GCC CAG CTT TTG GGC GGA AGC AGC AGC CTA CTG CAC ATC GCT 315 
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Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser Leu Leu His lie Ala 

95 100 105 

GCA GTC TTC ATT GTA CTT GAG TTC CTG TAC ACT GGT CTA TTC ATC 360 
Ala Val Phe lie Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe He 

110 115 120 

ACC ACA CAT GAC GCA ATG CAT GGC ACC ATA GCT TTG AGG CAC AGG 405 
Thr Thr His Asp Ala Met His Gly Thr He Ala Leu Arg His Arg 

125 130 135 

CAG CTC AAT GAT CTC CTT GGC AAC ATC TGC ATA TCA CTG TAC GCC 450 
Gin Leu Asn Asp Leu Leu Gly Asn He Cys lie Ser Leu Tyr Ala 

140 145 150 

TGG TTT GAC TAC AGC ATG CTG CAT CGC AAG CAC TGG GAG CAC CAC 495 
Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His His 

155 160 165 

AAC CAT ACT GGC GAA GTG GGG AAA GAC CCT GAC TTC CAC AAG GGA 540 
Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 

170 175 180 

AAT CCC GGC CTT GTC CCC TGG TTC GCC AGC TTC ATG TCC AGC TAC 585 
Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr 

185 190 195 

ATG TCC CTG TGG CAG TTT GCC CGG CTG GCA TGG TGG GCA GTG GTG 630 
Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val 

200 205 210 

ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT CTC CTA GTC TTC ATG 675 
Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met 

215 220 225 

GCT GCA GCC CCA ATC TTG TCA GCA TTC CGC CTC TTC TAC TTC GGC 720 
Ala Ala Ala Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly 

230 235 240 
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ACT TAC CTG CCA CAC AAG CCT GAG CCA GGC CCT GCA GCA GGC TCT 765 
Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 

245 250 255 

CAG GTG ATG GCC TGG TTC AGG GCC AAG ACA AGT GAG GCA TCT GAT 810 
Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp 

260 265 270 

GTG ATG AGT TTC CTG ACA TGC TAC CAC TTT GAC CTG CAC TGG GAG 855 
Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu 

275 280 285 

CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG CAG CTG CCC CAC TGC 900 
His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys 

290 295 300 

CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA TGA 942 
Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 

305 310 313 

B2?iJ#-f-: 7 
mPlO&Z : 8 6 7 
K?lJ©S! : 

zmonm : cdna 

QeM^ : Haematococcus pluvial is 
: N1ES-144 

ATG CCA TCC GAG TCG TCA GAC GCA GCT CGT CCT GCG CTA AAG CAC 45 
Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His 
15 10 15 
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GCC TAC AAA CCT CCA GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG 90 

Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie Thr Met Ala 

20 25 30 

CTG ACC ATC ATT GGC ACC TGG ACC GCA GTG TTT TTA CAC GCA ATA 135 

Leu Thr lie lie Gly Thr Trp Thr Ala Val Phe Leu His Ala lie 

35 40 45 

TTT CAA ATC AGG CTA CCG ACA TCC ATG GAC CAG CTT CAC TGG TTG 180 

Phe Gin lie Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp Leu 

50 55 60 

CCT GTG TCC GAA GCC ACA GCC CAG CTT TTG GGC GGA AGC AGC AGC 225 

Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 

65 70 75 

CTA CTG CAC ATC GCT GCA GTC TTC ATT GTA CTT GAG TTC CTG TAC 270 

Leu Leu His He Ala Ala Val Phe lie Val Leu Glu Phe Leu Tyr 

80 85 90 

ACT GGT CTA TTC ATC ACC ACA CAT GAC GCA ATG CAT GGC ACC ATA 315 

Thr Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly Thr He 

95 100 105 

GCT TTG AGG CAC AGG CAG CTC AAT GAT CTC CTT GGC AAC ATC TGC 360 

Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn lie Cys 

110 115 120 

ATA TCA CTG TAC GCC TGG TTT GAC TAC AGC ATG CTG CAT CGC AAG 405 

lie Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys 

125 130 135 

CAC TGG GAG CAC CAC AAC CAT ACT GGC GAA GTG GGG AAA GAC CCT 450 

His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 

140 145 150 

GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG TTC GCC AGC 495 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser 
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155 

TTC ATG TCC AGC TAC ATC TCC CTG TGG 
Phe Met Ser Ser Tyr Met Ser Leu Trp 

170 

TGG TGG GCA GTG GTG ATG CAA ATG CTG 
Trp Trp Ala Val Val Met Gin Met Leu 

185 

CTC CTA GTC TTC ATG GCT GCA GCC CCA 
Leu Leu Val Phe Met Ala Ala Ala Pro 

200 

CTC TTC TAC TTC GGC ACT TAC CTG CCA 
Leu Phe Tyr Phe Gly Thr Tyr Leu Pro 

215 

CCT GCA GCA GGC TCT CAG GTG ATG GCC 
Pro Ala Ala Gly Ser Gin Val Met Ala 

230 

ACT GAG GCA TCT GAT GTG ATG AGT TTC 
Ser Glu Ala Ser Asp Val Met Ser Phe 

245 

GAC CTG CAC TGG GAG CAC CAC AGG TGG 
Asp Leu His Trp Glu His His Arg Trp 

260 

CAG CTG CCC CAC TGC CGC CGC CTG TCC 
Gin Leu Pro His Cys Arg Arg Leu Ser 

275 

GCC TTG GCA TGA 867 
Ala Leu Ala 
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160 


165 




CAG TTT GCC 


CGG CTG GCA 


r ja 

540 


Gin Phe Ala 


Arg Leu Ala 




175 


1 OA 

loU 




rrr rrr rrr 
000 000 000 


ATP fTA AAT 
A 10 OOA AA1 


Oou 


uly Ala rrO 


Mec Aia Asn 




190 






ATC TTG TCA 


ppa ttp rrr 
OOA 1 1 0 000 


con 


lie Leu Ser 


Ala Phe Arg 




205 


210 




CAC AAG CCT 


PAP PPA PPP 

OAO OOA 000 


0(0 


His Lys rro 


oiu rro oiy 




OOA 






tpp ttp app 

too rro aoo 


rrr aap apa 
000 AAO AOA 


79A 
f L\J 


Trp Phe Arg 


Ala Lys Thr 




235 


240 




CTG ACA TGC 


TAC CAC TTT 


765 


Leu Thr Cys 


Tyr His Phe 




250 


255 




TTT PPP 

CCC TTT OCC 


****** xor* T/*r 
CtL Ibb loU 


Q1 A 

oil) 


Pro Phe Ala 


Pro Trp Trp 




265 


270 




GGG CGT GGC 


CTG GTG CCT 


855 


Gly Arg Gly 


Leu Val Pro 




280 


285 
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A 

T 176 

ATC CAC CTC CCA 

Met His Val Ala 
230 

TCC AGC CCA GAC 
Ser Ser Pro Asp 
284 

TCG TCA GAC GCA 
Ser Ser Asp Ala 
338 

GAC GCC AAG GGC 
Asp Ala Lys Gly 
392 

TTT TTA CAC GCA 
Phe Leu His Ala 
446 

TGG TTG CCT GTG 
Trp Leu Pro Val 
500 

CTG CAC ATC GCT 
Leu His lie Ala 
554 

ATC ACC ACA CAT 
lie Thr Thr His 
608 

AAT GAT CTC CTT 
Asn Asp Leu Leu 
662 

ATG CTG CAT CGC 
Met Leu His Arg 
716 

GAC CCT GAC TTC 
Asp Pro Asp Phe 
770 

ATG TCC AGC TAC 
Met Ser Ser Tyr 
824 

GTG ATG CAA ATG 
Val Met Gin Met 
878 

GCC CCA ATC TTG 
Ala Pro He Leu 
932 

AAG CCT GAG CCA 
Lys Pro Glu Pro 
986 

AAG ACA AGT GAG 
Lys Thr Ser Glu 

1040 
CTG CAC TGG GAG 
Leu His Trp Glu 

1094 

TGC CGC CGC CTG 
Cys Arg Arg Leu 



18S 
TCG GCA CTA 
Ser Ala Leu 

239 
GTC TTG AGA 
Val Leu Arg 

293 
CCT CGT CCT 
Ala Arg Pro 

347 

ATC ACG ATG 
He Thr Met 

401 
ATA TTT CAA 

He Phe Gin 

455 
TCC GAA GCC 
Ser Glu Ala 

509 
GCA GTC TTC 
Ala Val Phe 

563 
GAC GCA ATG 
Asp Ala Met 

617 
GGC AAC ATC 
Gly Asn He 

671 

AAG CAC TGG 
Lys His Trp 

725 
CAC AAG GGA 
His Lys Gly 

779 
ATG TCC CTG 
Met Ser Leu 

833 
CTG GGG GCG 
Leu Gly Ala 

887 
TCA GCA TTC 
Ser Ala Phe 

941 

GGC CCT GCA 
Gly Pro Ala 

995 
GCA TCT GAT 
Ala Ser Asp 

1049 
CAC CAC AGG 
His His Arg 

1103 
TCC GGG CGT 
Ser Gly Arg 



194 
ATG GTC 
Met Val 

248 

GCG 
Ala 



TGG 
Trp 
302 
CTA 
Leu 
356 
CTG 
Leu 
410 
AGG 
Arg 
464 
GCC 
Ala 
518 
GTA 
Val 
572 
GGC 
Gly 
626 
ATA 

He 
680 
CAC 
His 
734 
CCC 
Pro 
788 
CAG 
Gin 
842 
ATG 

Met 
896 
CTC 

Leu 
950 
GGC 
Gly 
1004 
GTG ATG 
Val Met 
1058 
TGG CCC 
Trp Pro 
1112 
GGC CTG 
Gly Leu 



GCG 
Ala 

GCG 
Ala 

ATC 

He 

ACA 
Thr 

ATT 
He 

CAT 
His 

TGC 
Cys 

GAG 
Glu 

AAT 
Asn 

TGG 
Trp 

CCC 
Pro 

CGC 
Arg 

GCA 
Ala 



GAG 
Glu 

GCG 
Ala 

AAG 
Lys 

ACC 
Thr 

CTA 
Leu 

CAG 
Gin 

CTT 
Leu 

ACC 
Thr 

TCA 
Ser 

CAC 
His 

GGC 
Gly 

TTT 
Phe 

GCA 
Ala 

TTC 
Phe 

TCT 

Ser 

AGT 
Ser 

TTT 
Phe 

GTG 
val 



203 
CAG AAA GGC 
Gin Lys Gly 

257 

ACA CAG TAT 
Thr Gin Tyr 

311 
CAC GCC 
His Ala 

36S 
ATC ATT GGC 
He He Gly 

419 
CCG ACA TCC 
Pro Thr Ser 

473 
CTT TTG 
Leu Leu 

527 

GAG TTC CTG 
Glu Phe Leu 
581 

ATA GCT TTG 
He Ala Leu 

635 
CTG TAC GCC 
Leu Tyr Ala 

689 
AAC CAT ACT 
Asn His Thr 

743 
CTT GTC CCC 
Leu Val Pro 

797 

GCC CGG CTG 
Ala Arg Leu 

851 
AAT CTC CTA 
Asn Leu Leu 

905 
TAC TTC GGC 
Tyr Phe Gly 

959 
CAG GTG ATG 
Gin Val Met 

1013 
TTC CTG ACA 
Phe Leu Thr 

1067 
GCC CCC TGG 
Ala Pro Trp 

1121 
CCT GCC TTG 
Pro Ala Leu 



TAC AAA 
Tyr Lys 



GGC GGA 
Gly Gly 



212 
AGT GAG GCA 
Ser Glu Ala 

266 
CAC ATG CCA 
His Met Pro 

320 

CCT CCA 
Pro Pro 
374 

ACC TGG ACC 
Thr Trp Thr 
426 

ATG GAC CAG 
Met Asp Gin 
482 

AGC AGC 
Ser Ser 
536 
TAC ACT GGT 
Tyr Thr Gly 
590 

AGG CAC AGG 
Arg His Arg 
644 

TGG TTT GAC 
Trp Phe Asp 
698 

GGC GAA GTG 
Gly Glu Val 

752 
TGG TTC GCC 
Trp Phe Ala 

806 
GCA TGG TGG 
Ala Tro Trp 

860 
GTC TTC ATG 

Val Phe Met 
914 

ACT TAC CTG 
Thr Tyr Leu 
968 

GCC TGG TTC 
Ala Trp Phe 

1022 
TGC TAC CAC 
Cys Tyr Kis 

1076 
TGG CAG CTG 
Trp Gin Leu 

1130 
GCA TCA 
Ala 



221 
GCT 
Ala 
275 
GAG 
Glu 
329 
TCT 
Ser 
383 
GTG 
Val 
437 
CAC 
His 
491 
CTA 
Leu 
545 
TTC 
Phe 
599 
CTC 
Leu 
653 
AGC 
Ser 
707 
AAA 
Lys 
761 
TTC 
Phe 
815 
GTG 
Val 
869 
GCA 
Ala 
923 
CAC 
His 
977 

GCC 
Ala 
1031 
TTT GAC 
Phe Aso 
1085 
CCC CAC 
Pro His 



GCT 
Ala 

TCC 
Ser 

GCA 
Ala 

GCA 
Ala 

CTT 
Leu 

AGC 
Ser 

CTA 
Leu 

CAG 
Gin 

TAC 
Tyr 

GGG 
Gly 

AGC 
Ser 

GCA 
Ala 

GCT 
Ala 

CCA 
Pro 

AGG 
Arg 
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B 

T 197 206 215 

ATG CTC GAG CAG AAA GGC AGT GAG GCA 
Met Val Glu Gin Lya Gly Ser Glu Ala 

251 260 269 

GCG TGG GCG ACA CAG TAT CAC ATG CCA 
Ala Trp Ala Thr Gin Tyr His Met Pro 

305 314 323 

GCG CTA AAG CAC GCC TAC AAA CCT CCA 
Ala Leu Lys His Ala Tyr Lys Pro Pro 

359 368 " 377 

GCG CTG ACC ATC ATT GGC ACC TGG ACC 
Ala Leu Thr lie .lie Gly Thr Trp Thr 

413 422 431 

ATC AGG CTA CCG ACA TCC ATG GAC CAG 
lie Arg Leu Pro Thr Ser Met Asp Gin 

467 476 485 

ACA GCC CAG CTT TTG GGC GGA AGC AGC 
Thr Ala Gin Leu Leu Gly Gly Ser Ser 

521 530 " 539 

ATT GTA CTT GAG TTC CTG TAC ACT GGT 
He Val Leu Glu Phe Leu Tyr Thr Gly 

575 584 593 

CAT GGC ACC ATA GCT TTG AGG CAC AGG 
His Gly Thr He Ala Leu Arg His Arg 

629 638 647 

TGC ATA TCA CTG TAC GCC TGG TTT GAC 
Cys He Ser Leu Tyr Ala Trp Phe Asp 

683 692 701 

GAG CAC CAC AAC CAT ACT GGC GAA GTG 
Glu His His Asn His Thr Gly Glu Val 

737 746 755 

AAT CCC GGC CTT GTC CCC TGG TTC GCC 
Asn Pro Gly Leu Val Pro Trp Phe Ala 

791 800 809 

TGG CAG TTT GCC CGG CTG GCA TGG TGG 
Trp Gin Phe Ala Arg Leu Ala Trp Tro 

845 854 863 

CCC ATG GCA AAT CTC CTA GTC TTC ATG 
Pro Met Ala Asn Leu Leu Val Phe Met 

899 908 917 

CGC CTC TTC TAC TTC GGC ACT TAC CTG 
Arg Leu Phe Tyr Phe Gly Thr Tyr Leu 

953 962 971 

GCA GGC TCT CAG GTG ATG GCC TGG TTC 
Ala Gly Ser Gin Val Met Ala Trp Phe 
1007 1016 1025 

GTG ATG AGT TTC CTG ACA TGC TAC CAC 
Val Met Ser Phe Leu Thr Cys Tyr His 
1061 1070 1079 

TGG CCC TTT GCC CCC TGG TGG CAG CTG 
Trp Pro Phe Ala Pro Trp Trp Gin Leu 

HIS 1124 
GGC CTG GTG CCT GCC TTG GCA TGA 
Gly Leu Val Pro Ala Leu Ala 

l 

D 
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224 






233 






242 


GCT 


GCT 


TCC 


AGC 


CCA 


GAC 


GTC 


TTG 


AGA 


Ala 


Ala 


Ser 


Ser 


Pro 


Asp 


Val 


Leu Arg 






278 






287 






296 


TCC 


GAG 


TCG 


TCA 


GAC 


GCA 


GCT 


CGT 


CCT 


Ser 


Glu 


Ser 


Ser 


Asp 


Ala 


Ala 


Arg Pro 






332 






341 






350 


GCA 


TCT 


CAC 


GCC 


AAG 


GGC 


ATC 


ACG 


ATG 


Ala 


Ser 


Asp 


Ala 


Lys Gly 


He 


Thr 


Met 






386 






395 






404 


GCA 


GTG 


TTT 


TTA 


CAC 


GCA 


ATA 


TTT 


CAA 


Ala 


Val 


Phe 


Leu 


His 


Ala 


He 


Phe 


Gin 






440 






449 






458 


CTT 


CAC 


TGG 


TTG 


CCT 


GTG 


TCC 


GAA 


GCC 


Leu 


His 


Trp 


Leu 


Pro 


Val 


Ser 


Glu 


Ala 






494 






503 






512 


AGC 


CTA 


CTG 


CAC 


ATC 


GCT 


GCA 


GTC 


TTC 


Ser 


Leu 


Leu 


His 


He 


Ala 


Ala 


Val 


Phe 


• 




548 






557 






566 


CTA 


TTC 


ATC 


ACC 


ACA 


CAT 


GAC 


GCA 


ATG 


Leu 


Phe 


He 


Thr 


Thr 


His 


Asp 


Ala 


Met 






602 






611 




620 


CAG 


CTC 


AAT 


GAT 


CTC 


CTT 


GGC 


AAC 


ATC 


Gin 


Leu 


Asn 


Asp 


Leu 


Leu 


Gly 


Asn 


He 






656 




665 




674 


TAC 


AGC 


ATG 


CTG 


CAT 


CGC 


AAG 


CAC 


TGG 


Tyr 


Ser 


Met 


Leu 


His 


Arg Lys 


His 


Trp 






710 






719 






728 


GGG 


AAA 


GAC 


CCT 


GAC 


TTC 


CAC 


AAG 


GGA 


Gly 


Lys 


Asp 


Pro 


Asp 


Phe 


His 


Lys 


Gly 






764 




773 






782 


AGC 


TTC 


ATG 


TCC 


AGC 


TAC 


ATG 


TCC 


CTG 


Ser 


Phe 


Met 


Ser 


Ser 


Tyr Met 


Ser 


Leu 






818 






827 






836 


GCA 


GTG 


GTG 


ATG 


CAA 


ATG 


CTG 


GGG 


GCG 


Ala 


val 


Val 


Met 


Gin 


Met 


Leu 


Gly Ala 






872 






881 






890 


GCT 


GCA 


GCC 


CCA 


ATC 


TTG 


TCA 


GCA 


TTC 


Ala 


Ala 


Ala 


Pro 


He 


Leu 


Ser 


Ala 


Phe 






926 






935 






944 


CCA 


CAC 


AAG 


CCT 


GAG 


CCA 


GGC 


CCT 


GCA 


Pro 


His 


Lys 


Pro 


Glu 


Pro Gly 


Pro 


Ala 






980 






989 






998 


AGG 


GCC 


AAG 


ACA 


AGT 


GAG 


GCA 


TCT 


GAT 


Arg Ala 


Lys 


Thr 


Ser 


Glu 


Ala 


Ser 


Asp 




1034 




1043 




1052 


TTT 


GAC 


CTG 


CAC 


TGG 


GAG 


CAC 


CAC 


AGG 


Phe 


Asp 


Leu 


His 


Trp 

1 


Glu 


His 


His 


Arg 




1088 




.097 




1106 


CCC 


CAC 


TGC 


CGC 


CGC 


CTG 


TCC 


GGG 


CGT 


Pro 


His 


Cys 


Arg 


Arg 


Leu 


Ser 


Gly Arg 
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c 

I 272 281 290 299 308 317 

ATG CCA TCC GAG TCG TCA GAC GCA GCT CCT CCT GCG CTA AAG CAC GCC TAC AAA 

Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys 

326 335 344 353 362 371 

CCT CCA GCA TCT GAC GCC AAG GGC ATC ACG ATG GCG CTG ACC ATC ATT GGC ACC 

Pro Pro Ala Ser Asp Ala Lys Cly He Thr Met Ala Leu Thr He He Gly Thr 

380 3 *9 398 407 416 425 

TGG ACC GCA GTG TTT TTA CAC GCA ATA TTT CAA ATC AGG CTA CCG ACA TCC ATG 

Trp Thr Ala Val Phe Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met 

434 443 452 461 470 479 

GAC CAG CTT CAC TGG TTG CCT GTG TCC CAA GCC ACA GCC CAG CTT TTG GGC GGA 

Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly 

488 497 506 515 524 533 

c GC r CTA ? TG St C ATC GCT GCA GTC TTC ATT GTA CT T GAG TTC CTG TAC 
Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr 

542 551 560 569 578 587 

ACT GGT CTA TTC ATC ACC ACA CAT GAC GCA ATG CAT GGC ACC ATA GCT TTG AGG 
Thr Gly Leu Phe He Thr Thr His Asp Ala Met His Gly Thr He Ala Leu Arg 

59 605 614 623 632 641 

CAC AGG CAG CTC AAT GAT CTC CTT GGC AAC ATC TGC ATA TCA CTG TAC GCC TGG 
His Arg Gin Leu Asn Asp Leu Leu Gly Asn He Cys He Ser Leu Tyr Ala Trp 

650 *59 668 677 686 695 

TTT GAC TAC AGC ATG CTG CAT CGC AAG CAC TGG GAG CAC CAC AAC CAT ACT GGC 
Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His His Asn His Thr Gly 

704 713 722 731 740 749 

GAA GTG GGG AAA GAC CCT GAC TTC CAC AAG GGA AAT CCC GGC CTT GTC CCC TGG 
Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp 

758 767 776 785 794 803 

TTC GCC AGC TTC ATG TCC AGC TAC ATG TCC CTG TGG CAG TTT GCC CGG CTG GCA 
Phe Ala ser Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala 

812 821 830 839 848 857 

l°t ??* w T ? S T ? ATG CAA ATG CTG GGG GCG CCC ATG GCA AAT CTC CTA GTC 
Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 

Bb& 87 5 884 893 902 911 

III $1+ ?? A CCA ATC TTG TCA GCA TTC CGC "C TTC TAC TTC GGC ACT 

Phe Met Ala Ala Ala Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr 
920 929 938 947 955 955 

Itr rl?* o CA u? C ^ G CCT GAG CCA GGC CCT GCA GCA GGG TCT CAG GTG ATG GCC 
Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala 

974 983 992 1001 1010 1019 

l G ° 11° ^ GG AAG ACA AGT GAG GCA TCT GAT GTG ATG AGT TTC CTG ACA TGC 

Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys 
1028 1037 1046 1055 1064 1073 

l* C £° ll T GAC CTG CAC TGG GAG CAC CAC AGG TGG CCC TTT GCC CCC TGG TGG 
Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Tr? 

1082 1091 1100 ' 1109 1118 1127 

r?S r TG o CC u AC I GC CGC CGC CTG TCC GGG CGT GGC CTG GTG CCT GCC TTG GCA 
Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 

TGA A 

D 

* * » 
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30 60 
CGGGGCAACT CAAGAAATTC AACAGCTGCA AGCGCGCCCC AGCCTCACAG CGCCAAGTGA 
GCCCCGTTGA GTTCTTTAAG TTGTCGACGT TCGCGCGGGG TCGGAGTGTC GCGGTTCACT 

90 120 
GCTATCGACG TGGTTGTGAG CGCTCGACGT GGTCCACTGA CGGGCCTGTG AGCCTCTGCG 
CGATAGCTGC ACCAACACTC GCGAGCTGCA CCAGGTGACT GCCCGGACAC TCGGAGACGC 

150 T 180 

CTCCGTCCTC TGCCAAATCT CGCGTCGGGG CCTGCCTAAG TCGAAGAATG CACGTCGCAT 
GAGGCAGGAG ACGGTTTAGA GCGCAGCCCC GGACGGATTC AGCTTCTTAC GTGCAGCGTA 

J 210 240 
T GGTCGAGCAG AAAGGCAGTG AGGCAGCTGC TTCCAGCCCA GACGTCTTGA 
GCCGTGATTA CCAGCTCGTC TTTCCGTCAC TCCGTCGACG AAGGTCGGGT CTGCAGAACT 

c 

t 270 300 

GAGCGTGGGC G AC AC AG TAT CACATGCCAT CCGAGTCGTC AGACGCAGCT CGTCCTGCGC 
CTCGCACCCG CTGTGTCATA GTGTACGGTA GGCTCAGCAG TCTGCGTCGA GCAGGACGCG 

330 360 
TAAAGCACGC CTACAAACCT CCAGCATCTG ACGCCAAGGG CATCACGATG GCGCTGACCA 
ATTTCGTGCG GATGTTTGGA GGTCGTAGAC TGCGGTTCCC GTAGTGCTAC CGCGACTGGT 

390 420 
TCATTGGCAC CTGGACCGCA GTGTTTTTAC ACGCAATATT TCAAATCAGG CTACCGACAT 
AGTAACCGTG GACCTGGCGT CACAAAAATG TGCGTTATAA AGTTTAGTCC GATGGCTGTA 

450 480 
CCATGGACCA GCTTCACTGG TTGCCTGTGT CCGAAGCCAC AGCCCAGCTT TTGGGCGGAA 
GGTACCTGGT CGAAGTGACC AACGGACACA GGCTTCGGTG TCGGGTCGAA AACCCGCCTT 

510 540 
GCAGCAGCCT ACTGCACATC GCTGCAGTCT TCATTGTACT TGAGTTCCTG TACACTGGTC 
CGTCGTCGGA TGACGTGTAG CGACGTCAGA AGTAACATGA ACTCAAGGAC ATGTGACCAG 

570 600 
TATTCATCAC CACACATGAC GCAATGCATG GCACCATAGC TTTGAGGCAC AGGCAGCTCA 
ATAAGTAGTG GTGTGTACTG CGTTACGTAC CGTGGTATCG AAACTCCGTG TCCGTCGAGT 

630 680 
ATGATCTCCT TGGCAACATC TGCATATCAC TGTACGCCTG GTTTGACTAC AGCATGCTGC 
TACTAGAGGA ACCGTTGTAG ACGTATAGTG ACATGCGGAC CAAACTGATG TCGTACGACG 

690 12 i 
ATCGCAAGCA CTGGGAGCAC CACAACCATA CTGGCGAAGT GGGGAAAGAC CCTGACTTCC 
TAGCGTTCGT GACCCTCGTG GTGTTGGTAT GACCGCTTCA CCCCTTTCTG GGACTGAAGG 

750 780 
ACAAGGGAAA TCCCGGCCTT GTCCCCTGGT TCGCCAGCTT CATGTCCAGC TACATGTCCC 
TGTTCCCTTT AGGGCCGGAA CAGGGGACCA AGCGGTCGAA GTACAGGTCG ATGTACAGGG 

610 840 
TGTGGCAGTT TGCCCGGCTG GCATGGTGGG CAGTGGTGAT GCAAATGCTG GGGGCGCCCA 
ACACCGTCAA ACGGGCCGAC CGTACCACCC GTCACCACTA CGTTTACGAC CCCCGCGGGT 
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870 900 



TGGCAAATCT CCTAGTCTTC ATGGCTGCAG CCCCAATCTT GTCAGCATTC CGCCTCTTCT 
ACCGTTTAGA GGATCAGAAG TACCGACGTC GGGGTTAGAA CAGTCGTAAG GCGGAGAAGA 



930 960 



ACTTCGGCAC TTACCTGCCA CACAAGCCTG AGCCAGGCCC TCCAGCAGGC TCTCAGGTGA 
TGAAGCCGTG AATGGACGGT GTGTTCGGAC TCGGTCCGGG ACGTCGTCCG AGAGTCCACT 



990 1020 



^ 9 W 

TGGCCTGGTT CAGGGCCAAG ACAAGTGAGG CATCTGATGT GATGAGTTTC "GACATGCT 
ACCGGACCAA GTCCCGGTTC TGTTCACTCC GTAGACTACA CTACTCAAAG GACTGTACGA 



ioso 1080 



ACCACTTTGA CCWCACTGG GAGCACCACA GGTGGCCCTT TGCCCCCTGG TGGCAGCTGC 
TGGTGAAACT GGACGTGACC CTCGTGGTGT CCACCGGGAA ACGGGGGACC ACCGTCGACG 



1110 1140 



CCCACTGCCG CCGCCTGTCC GGGCGTGGCC TGGTGCCTGC CTTGGCATGA CCTGGTCCCT 

Sctgacggc ggcggacagg cccgcaccgg accacggacg gaaccgjact GGACCAGGGA 

1170 D 1200 

CCGCTGGTGA CCCAGCGTCT GCACAAGAGT GTCATGCTAC AGGGTGCTGC GGCCAGTGGC 
GGCGACCACT GGGTCGCAGA CGTGTTCTCA CAGTACGATG TCCCACGACG CCGGTCACCG 



1230 1260 



AGCGCAGTGC ACTCTCAGCC TGTATGGGGC TACCGCTGTG CCACTGAGCA CTGGGCATGC 
icGCGTScG TGAGAGTGGG ACATACCCCG ATGGCGACAC GGTGACTCGT GACCCGTACG 



1290 1320 



CACTGAGCAC TGGGCGTGCT ACTGAGCAAT GGGCGTGCTA CTGAGCAATG GGCGTGCTAC 
GTGACTCGTG ACCCGCACGA TGACTCGTTA CCCGCACGAT GACTCGTTAC CCGCACGATG 



1350 1380 



TGACAATGGG CGTGCTACTG GGGTCTGGCA GTGGCTAGGA TGGAGTTTGA TGCATTCAGT 
ACTGTTACCC GCACGATGAC CCCAGACCGT CACCGATCCT ACCTCAAACT ACGTAAGTCA 



1440 

1410 J 



AGCGGTGGCC AACGTCATGT GGATGGTGGA AGTGCTGAGG GGTTTAGGCA GCCGGCATTT 
TCGCCACCGG TTGCAGTACA CCTACCACCT TCACGACTCC CCAAATCCGT CGGCCGTAAA 



1470 1500 



GAGAGGGCTA AGTTATAAAT CGCATGCTGC TCATGCGCAC ^ATCTGCAC ACAGCCAGGG 
CTCTCCCGAT TCAATATTTA GCGTACGACG AGTACGCGTG TATAGACGTG TGTCGGTCCC 



1530 1560 



AAATCCCTTC GAGAGTGATT ATGGGACACT TGTATTGGTT TCCTOCTATT GTTTTATTCA 
TTTAGGGAAG CTCTCACTAA TACCCTGTGA ACATAACCAA AGCACGATAA CAAAAlAi^i 



1590 1620 



GCAGCAGTAC TTAGTGAGGG TGAGAGCAGG GTGGTGAGAG TGGAGTGAGT GACTATGAAC 
CGTCGTCATG AATCACTCCC ACTCTCGTCC CACCACTCTC ACCTCACTCA CTCATACTTG 



1650 1677 



CTGGTCAGCG AGGTGAACAG CCTGTAATGA ATGACTCTGT CTAAAAAAAA AAAAAAA 
GACCAGTCGC TCCACTTGTC GGACATTACT TACTGAGACA GATTTTTTTT TTTTTTT 
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10 20 30 40 50 60 

CGGGGCAACT CAAGAAATTC AACAGCTGCA AGCGCGCCCC AGCCTCACAG CGCCAAGTGA 

70 BO 90 100 HO 120 

GCTATCGACG TGGTTGTGAG CGCTCGACGT CGTCCACTGA CGGGCCTGTG ^CCTCTGCG 

130 140 150 160 Tito 180 

CTQCGTC^TC TGCCAAATCT CGCGTCGGGG CCTGCCTAAG TCGAAG>£l3 CACGTCGCAT 
"^ 27 T 200 210 220 230 240 



CGGCACTaEE&TCGAGCAG AAAGGCAGTG AGGCAGCTCjC TTCCAGCCCA GACGTCTTGA 
250 260 Y 270 280 31 290 300 



pAGCGTGGGC GACACAGTAT CACgpXAT CCGAGTCGTC AGACGCAGCT CGTCCTGCGC 
310 320 



^37 ... _ ^ 2 330 340 350 360 



310 J^U oju ~ ~ _______ 

TAAAGCACGC CTACAAACCT CCAGCATCTG ACGCCAAGGG CATCACGgTCl GCGCTGACCA 

Vlo 380 390 400 410 420 

TCATTGGCAC CTGGACCGCA GTGTTTTTAC ACGCAATATT TCAAATCAGG CTACCGACAT 

430 440 450 460 470 480 

CCS33ACCA GCTTCACTGG TTGCCTGTGT CCGAAGCCAC AGCCCAGCTT TTGGGCGGAA 
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